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ABSTRACT 

Clustered regularly interspaced short palindromic 
repeats (CRISPR), in combination with CRISPR as- 
sociated (cas) genes, constitute CRISPR-Cas bac- 
terial adaptive immune systems. To generate im- 
munity, these systems acquire short sequences of 
nucleic acids from foreign invaders and incorpo- 
rate these into their CRISPR arrays as spacers. 
This adaptation process is the least characterized 
step in CRISPR-Cas immunity. Here, we used Pec- 
tobacterium atrosepticum to investigate adaptation 
in Type l-F CRISPR-Cas systems. Pre-existing spac- 
ers that matched plasmids stimulated hyperactive 
primed acquisition and resulted in the incorporation 
of up to nine new spacers across all three native 
CRISPR arrays. Endogenous expression of the cas 
genes was sufficient, yet required, for priming. The 
new spacers inhibited conjugation and transforma- 
tion, and interference was enhanced with increas- 
ing numbers of new spacers. We analyzed -^350 new 
spacers acquired in priming events and identified a 
5'-protospacer-GG-3' protospacer adjacent motif. In 
contrast to priming in Type l-E systems, new spac- 
ers matched either plasmid strand and a biased dis- 
tribution, including clustering near the primed pro- 
tospacer, suggested a bi-directional translocation 
model for the Cas1:Cas2-3 adaptation machinery. 
Taken together these results indicate priming adap- 
tation occurs in different CRISPR-Cas systems, that 
it can be highly active in wild-type strains and that 
the underlying mechanisms vary. 



INTRODUCTION 

CRISPR-Cas (clustered regularly interspaced short palin- 
dromic repeats-CRISPR-associated proteins) is a small 
RNA-based prokaryotic defense mechanism that functions 
against a wide range of mobile genetic elements (MGEs) 
(for recent reviews see (1-6)). Immunity against MGEs 
is provided through a sequence-specific adaptive memory. 
CRISPR arrays are characterized by short repeats sepa- 
rated by similarly sized sequences (spacers), which are de- 
rived from invading MGEs. The mechanism of CRISPR in- 
terference involves three phases. During adaptation, short 
sequences derived from invader nucleic acids are incorpo- 
rated as new spacers into the CRISPR array. Next, the ex- 
pression stage results in the transcription of the CRISPRs 
from a promoter in a leader sequence upstream of the 
CRISPR arrays. These pre-crRNAs are processed into 
small RNAs (crRN As) by endonucleolytic Cas proteins. Fi- 
nally, in the interference step, Cas protein(s) and crRNAs 
form ribonucleoprotein complexes that target and degrade 
invading nucleic acids, which possess complementarity to 
the spacer portion of the crRNA. CRISPR-Cas systems are 
currently classified into three major Types (I-III), which are 
characterized by signature proteins (Cas3, Cas9 and CaslO, 
respectively) and further subdivided into subtypes, based on 
subtype-specific proteins (7). Considerable advances in un- 
derstanding the expression and interference phases in dif- 
ferent CRISPR-Cas systems have been made. In contrast, 
despite some knowledge about the adaptation phase, par- 
ticularly in the Type I-E system (8-15), Httle is known about 
adaptation in the Type I-F systems (reviewed in (16)). 

The acquisition of spacers upon an encounter with new 
foreign genetic elements is the prerequisite for develop- 
ing CRISPR-Cas resistance against this invader. This was 
first observed for the Type II system of Streptococcus ther- 
mophilus, where new phage-derived spacers were detected 
in CRISPR arrays following the isolation of phage resistant 
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strains (17). Spacers were acquired at the leader end of the 
CRISPR array (17) and this has been observed in subse- 
quent experiments in other systems (12,18-20). Unusually, 
in Sulfolobus solfataricus P2, internal spacer acquisition was 
observed specifically in one of the six CRISPR loci (19). Im- 
portantly, adaptation can occur through two different path- 
ways, termed naive and priming, which will be discussed be- 
low (16). 

Genetic experiments in Escherichia coli have begun to 
provide insight into the process of CRISPR adaptation 
in the Type I-E system (8-15). Only Casl and Cas2 
are conserved among all CRISPR-Cas types and are re- 
quired for new spacer incorporation in the Type I-E sys- 
tem (8,9,12). Recognition, and presumably cleavage, of 
the invader genome to generate new spacers involves de- 
tection of short motifs adjacent to the precursor spacer, 
which are termed protospacer adjacent motifs (PAMs; or 
spacer acquisition motifs) (9,12,21,22). Protospacers are 
the targeted/complementary strand of crRNAs and are de- 
noted in the 5'-3' direction (5,23)). The new spacer is then 
integrated at the leader-proximal end of the CRISPR, re- 
sulting in duplication of the first repeat (8,1 1,12). 

In the arms race between MGEs and their hosts, the in- 
vaders can 'escape' CRISPR-Cas immunity with point mu- 
tations that disrupt target recognition (8,24). Surprisingly, 
recent studies in Type I-E systems revealed that a second 
adaptation stage, termed priming, enables rapid adaptation 
to these escapees (8,10,11,15). In priming, invading escape 
MGEs are not immediately targeted, but trigger the accel- 
erated incorporation of new spacers, ultimately resulting in 
interference (8,11). Multiple spacers can be incorporated, 
providing increased resistance and reducing the probabil- 
ity of further escape (11). In contrast to naive adaptation, 
where Casl and Cas2 are the only Cas proteins required 
for spacer integration in Type I-E systems (12), priming 
needs Casl, Cas2, Cas3, crRNA and Cascade (CRISPR- 
associated complex for antiviral defense) (8). In Type II- 
A and II-B systems, in addition to Casl and Cas2, Csn2 
is required (17), and Cas4 is predicted to be involved, in 
the acquisition process (7). Priming in E. coli results in a 
spacer strand bias, whereby new spacers target the same 
strand as the original priming spacer, but the underlying 
mechanism is unknown (8,10,11,15). In contrast, priming 
was recently observed in the Type I-B system of Haloarcula 
hispanica, but new spacers targeted either strand of the in- 
vader DNA (20). Recently, we discovered that adaptation in 
the Type I-E system is primed by pre-existing spacers, even 
when they mismatch the invader by as much as 1 3 nt across 
the 35 nt PAM-protospacer sequence (15). A recent analysis 
of the PAM requirements for priming in a Type I-B system 
(25) confirmed our results that multiple PAMs can promote 
priming (15). This demonstrates that CRISPR-Cas systems 
are robust in their ability to restore resistance to escaping in- 
vaders, and suggests that resistance is generated more easily 
against closely related genetic elements that have not been 
previously encountered ( 1 5). This supports the concept that 
priming is the major contributor to adaptation when com- 
pared to naive acquisition. 

In contrast to the Type I-E system, there is limited knowl- 
edge about acquisition in Type I-F systems. With the excep- 
tion of a report where a few single spacers (seven unique) 



were incorporated (18), no information is available about 
adaptation in Type I-F systems. The I-F subtype is unique 
for studying acquisition as it is the only system in which 
the universal Cas2 and the signature Cas3 protein are fused 
as a hybrid-protein (Cas2-3) (7,26). We previously demon- 
strated that Cas2-3 interacts with Casl, suggesting that a 
Casl:Cas2-3 adaptation complex engages in spacer incor- 
poration (26). Here, to investigate acquisition in the Type 
I-F systems, we used the potato pathogen Pectobacterium 
atrosepticum (27). We have previously shown that this strain 
contains a single Type I-F cas operon and three CRISPR ar- 
rays that are transcribed and processed by Cas6f (Csy4) to 
yield crRNAs (28). The Csyl, Csy2, Csy3 and Cas6f pro- 
teins form an interference complex (26,29) that can inter- 
act with Cas2-3 (26). In this study, we detected hyperactive 
priming adaptation, with up to nine new spacers in a single 
strain and all three CRISPR arrays were active for incorpo- 
ration, albeit with different efficiencies. Type I-F priming re- 
sulted in a biased spacer acquisition pattern with new spac- 
ers complementary to both strands. The protospacer distri- 
bution supports a translocation model of localized spacer 
acquisition in Type I-F systems that is distinct from the 
strand-specific mechanism utilized by the Type I-E systems. 

MATERIALS AND METHODS 

Bacterial strains and growth conditions 

All strains and plasmids used in this study are given in Sup- 
plementary Table SI and details of their construction pro- 
vided in Supplementary Materials and Methods. P. atrosep- 
ticum SCRI1043 (30) was grown at 25°C and E. coli at 
37° C in Luria Broth (LB) at 180 rpm or on LB-agar (LBA) 
plates containing 1.5% (w/v) agar. When required, me- 
dia were supplemented with the following: ampicillin (Ap; 
100 |xg/ml), chloramphenicol (Cm; 25 |jLg/ml), kanamycin 
(Km; 50 |xg/ml), tetracycHne (Tc; 10 |xg/ ml) and D-glucose 
(0.2% w/v). Bacterial growth was measured in a Jenway 
6300 spectrophotometer at 600 nm (ODeoo)- 

Molecular biology and DNA sequencing 

Oligonucleotides were from Invitrogen or IDT and are 
listed in Supplementary Table S2. All strains and plasmids 
were confirmed by polymerase chain reaction (PCR) and 
DNA sequencing was performed at the Allan Wilson Cen- 
tre, New Zealand. Plasmid DNA was prepared using Zyppy 
Plasmid Miniprep Kits (Zymo Research). DNA from PCR 
and agarose gels was purified using the GE Healthcare II- 
lustra GFX PCR DNA and Gel Band Purification Kit. Re- 
striction enzymes and T4 DNA ligase were from Roche or 
NEB. 

Priming assays 

Five millilitres cultures of P. atrosepticum AHAI2 with 
pTRB30 (vector control) or pPF189 were grown overnight 
without antibiotic selection (Figure 1). Note that 10 |xl 
were used to inoculate a fresh overnight culture and dilu- 
tions were plated onto LBA. This was repeated over 5 days 
and performed in triphcate. Colonies (100) from each repli- 
cate were patched onto LBA ± Km. Km sensitive (Km^) 
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Figure 1. A pre-existing spacer:protospacer match accelerates Cas-dependent plasmid loss. (A) P. atrosepticum contains a Type I-F CRISPR-Cas system 
composed of three CRISPR arrays (1-3; gray arrows) and an operon of 6 cas genes (colored arrows). CRISPR2 consists of 10 spacers and spacer 6 (from 
leader proximal end; blue) perfectly matches a protospacer (red) in eca0560 in the chromosomal island HAI2, but has a TG PAM variant. (B) Schematic of 
the plasmid loss assays. P. atrosepticum AHAI2 carrying pTRB30 (control) or pPF189 (eca0560 primed; depicted) plasmids were grown without selection 
for 5 days and plasmid loss was scored by replica-plating on non-selective (NS) and selective (S) media. (C) Plasmid loss of a control plasmid (pTRB30) 
and the primed plasmid (pPF189) over 5 days when cultured in AHAI2 or AHAI2A«w backgrounds. Data shown are the mean ± SD of three biological 
replicates. 



colonies were screened by PCR for new spacers as described 
later. 

P. atrosepticum containing plasmids pPF571 (vector con- 
trol), pPF574 and pPF575 (priming vectors with proto- 
spacer 1 from the CRISPR 1 array with a mutated PAM in 
F and R orientations) were grown overnight in 5 ml LB and 
passaged for 5 days by transfer of 10 |jl1 to 5 ml fresh LB. 
Additionally, dilutions were plated on LBA + 1 mM Iso- 
propyl (B-D-l-thiogalactopyranoside (IPTG) for mCherry 
induction. White colonies were screened for CRISPR ex- 
pansion and sequenced as outlined below. 

CRISPR PCR and sequence analysis 

Colonies displaying plasmid loss were screened by colony 
PCR using primers for CRISPRl (PF174 and PF175), 
CRISPR2 (PF176 and PF177) or CRISPR3 (PF178 and 
PF179). The resulting products were separated on 2% 
agarose gels, purified and sequenced using PF175, PF177 or 
PF179 for CRISPR arrays 1, 2 or 3, respectively. Sequences 
were analyzed using CRISPRFinder (31), spacer sequences 
were extracted and assembled against target plasmids using 
GeneiousT^ and CRISPRTarget (23) to define the proto- 
spacer location, target strand and PAM. 

Transformation and conjugation assays 

Electrocompetent P. atrosepticum cells were prepared as 
described previously (32). For transformations, 50 ng of 
DNA was added to 50 |xl of competent cells, incubated on 
ice for 10 min then electroporated (1 mm electro-cuvettes, 
1800 V, capacitance 25 |jlF and resistance 200 ohms). Bac- 
teria were recovered in 1 ml LB for 2 h at 25°C and then 
plated on LB containing the appropriate supplements and 
grown at 25°C. Transformation efficiency was calculated as 
transformants/ng of DNA and normalized to non-targeted 
plasmid controls. 



For conjugation, the tested plasmids were transformed 
into E. coliSn-1 Xpir. Donor (E. co// S17-1 Xpir with tested 
plasmids) and recipient strains were grown overnight in LB 
with the appropriate antibiotics. The ODgoo was adjusted to 
1 and cells washed twice with LB. The donor and recipient 
strains were mixed (1:1 ratio), 5 |xl of the mixture spotted on 
0.2 |xm filters (Millipore) on LBA and incubated overnight. 
Cells were resuspended in 2 ml phosphate buffered saline 
by vortexing the filters and dilution series were plated on 
LBA (total cells), LBA + Sp (donors) and glucose minimal 
medium + Km (transconjugants). The efficiency of conju- 
gation was calculated as transconjugants per recipients. 

RESULTS 

A pre-existing spacer:protospacer match accelerates Cas- 
dependent plasmid loss 

To test whether priming was a general CRISPR phe- 
nomenon, we examined if the Type I-F system could acquire 
new spacers from a foreign element when an existing spacer 
was present. Previously, we have shown that P. atrosepticum 
contains a native spacer within CRISPR2 (termed C2S6; 
CRISPR2 Spacer 6) that has perfect complementarity to 
a chromosomal gene (eca0560) inside an integrative and 
conjugative element termed horizontally acquired island 2 
(HAI2) (28). We previously demonstrated that this spacer 
was non-functional for interference due to a variant PAM 
(5'-protospacer-TG-3') (32,33) (Figure lA). The lack of in- 
terference with C2S6 led us to query whether this -1 PAM 
point mutation would trigger priming. 

To test whether a native spacer in the chromosomal Type 
I-F array could promote spacer acquisition, we used strain 
AHAI2 (to prevent self-targeting due to priming), in which 
HAI2, including eca0560, was fully deleted, and introduced 
a control plasmid or a plasmid containing eca0560. Cul- 
tures were grown for 5 days and plasmid loss was deter- 
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mined (Figure IB and C). The control plasmid was sta- 
ble, whereas the plasmid with the protospacer (eca0560) 
was progressively lost over the time course, with 75% loss 
by day 5 (Figure IC). To test the contribution of the Cas 
proteins, the experiment was also performed in an isogenic 
cas operon deletion strain (AHAI2Acfl^). No plasmid loss 
was detected in the absence of the cas genes (Figure IC), 
confirming that plasmid curing driven by a pre-existing 
spacer;protospacer match was a Cas-dependent process. 

Plasmid loss is accompanied by spacer incorporation in all 
CRISPR arrays 

To confirm that plasmid loss was CRISPR-dependent, ex- 
pansion of the three native CRISPR arrays (CRISPR 1, 
2 and 3) was assessed (Figure 2A). Forty plasmid in- 
terfering mutants (PIMs) from independent experiments 
(to reduce siblings) from days 2 and 3 were checked by 
PCR, and 37/40 (93%) had at least one new spacer (Sup- 
plementary Table S3), whereas ~7% were the result of 
CRISPR-independent plasmid loss. The CRISPR arrays of 
37 CRISPR-dependent PIMs were sequenced and the ac- 
quired spacers were analyzed, revealing 105 new spacers 
(Supplementary Table S3). Sequencing the arrays revealed 
that all of the PIMs were unique and there were 68 spacers 
in CRISPRl (65%), 34 spacers in CRISPR2 (32%) and 3 
spacers in CRISPR3 (<3%) (Figure 2B). When analyzed for 
incorporation activity per array in each PIM, one acquisi- 
tion event was the most common and fewer PIMs were ob- 
served with increasing numbers of newly-acquired spacers 
per array (Figure 2C). Notably, CRISPRl and CRISPR2 
had a similar activity for acquiring the first and second spac- 
ers, but CRISPRl was more active for incorporating mul- 
tiple spacers and only single, independent incorporations 
occurred in CRISPR3 (Figure 2C). There was considerable 
variation in the total number of spacers acquired across all 
CRISPR arrays by each strain, with 2 spacers the most fre- 
quent number incorporated (Figure 2D). Remarkably, one 
strain (PIM 18) incorporated a total of 9 new spacers, 7 in 
CRISPRl and 2 in CRISPR2 (Figure 2D and Supplemen- 
tary Table S3). Of the PIMs with new spacers, most (81%) 
have at least one spacer in CRISPRl, 59% have at least one 
spacer in CRISPR2 and 8% have a spacer in CRISPR3 (Fig- 
ure 2E). The array activity per PIM is outlined in Figure 2E 
(i.e. 38% of PIMs have new spacers only in CRISPRl, yet 
35% have new spacers in both CRISPRl and CRISPR2). 
This demonstrates that all arrays are proficient for inte- 
gration and suggests that the order of incorporation activ- 
ity of these arrays is CRISPRl > CRISPR2 > CRISPR3. 
These data are highly consistent with the number of spac- 
ers present in the wild type (WT) strain (28, 10 and 3, re- 
spectively (27)) (Supplementary Figure SI) and promoter 
expression data, which showed a similar trend (28). The dif- 
ferent spacer acquisition efficiencies might be the result of 
repeat or leader variations. The consensus sequence of the 
CRISPRl and CRISPR2 repeats are the same, whereas the 
CRISPR3 repeats differ slightly (27). In addition, the lead- 
ers of each array are distinct, despite sharing some regions 
in common (data not shown). In conclusion, enhanced ac- 
quisition of nascent spacers occurs in native Type I-F sys- 
tems (i.e. without cas gene overexpression) when a pre- 
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Figure 2. Plasmid loss is accompanied by spacer incorporation in all 
CRISPR arrays. (A) Schematic of the spacer acquisition PCR screen (R, 
repeat and S, spacer). Forward primers specific to each CRISPR array (1- 
3) were used with reverse primers specific to the second spacer in each 
CRISPR and products separated on a gel (example shown). (B) Percentage 
distribution of the total number of new spacers in each CRISPR array. (C) 
Number of CRISPR arrays containing a particular number of new spac- 
ers. (D) Number of PIMs containing a total number of new spacers across 
all CRISPR arrays. (E) Percentage distribution of PIMs containing addi- 
tional spacers in particular CRISPR arrays (e.g. 35% of PIMs contain new 
spacers only in both CRISPRl and 2). 



existing interference-deficient spacer matches an MGE, but 
lacks a consensus PAM (see below). 



New spacers inhibit transformation and conjugation 

We hypothesized that the acquired spacers caused plas- 
mid loss. However, plasmid interference in the P. atrosep- 
ticum Type I-F system has not been demonstrated. To con- 
firm the spacers acquired were responsible for plasmid loss, 
we tested the transformation efficiency of PIMs contain- 
ing a single spacer in CRISPRl or CRISPR2 that targeted 
eca0560. Control or targeted (eca056O) plasmids were used. 
No PIM contained a new spacer exclusively in CRISPR3. 
Therefore, we transformed PIM23, which contains sin- 
gle new spacers in both CRISPRl and CRISPR3, with a 
plasmid containing the protospacer targeted by the new 
CRISPR3 spacer, but lacking the protospacer for the new 
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Figure 3. New spacers inhibit transformation and conjugation. (A). Transformation efficiency of strains with either no new spacers (AHAI2) or a single 
new spacer in CRISPRl (PIM06), CRISPR2 (PIMIV) or CRISPR3 (PIM23). Transformation was performed with a non-targeted control (pTRB30) or a 
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CRISPRl spacer. The AHAI2 strain, which contained no 
spacers targeting the plasmids, and the different PIMs were 
transformed with the targeted and non-targeted plasmids. 
The presence of a single additional spacer in CRISPRl or 
CRISPR2 was sufficient to reduce transformation efficiency 
by > 100-fold (the limit of detection) (Figure 3A). Surpris- 
ingly, the CRISPR3 spacer did not protect PIM23 from 
plasmid uptake (Figure 3A). A subtle difference in transfor- 
mation efficiency was observed between the control and tar- 
geted plasmids in the non-targeting AHAI2 control strain, 
which is likely due to differences in plasmid size. However, 
these minor differences cannot account for the level of plas- 
mid interference observed in the PIMs. 

Next, we tested if new spacers inhibited conjugation. A 
single spacer in either CRISPRl or CRISPR2 caused be- 
tween a 350- and 1 100-fold reduction in conjugation (Figure 
3B), providing the first evidence of inhibition of conjugation 
by Type I-F systems. In agreement with the transformation 
result, the CRISPR3 spacer provided no protection from 
conjugation (Figure 3B). Multiple spacer acquisitions were 
observed during priming (Figure 2); therefore, we used the 
conjugation assays to test if multiple spacers led to elevated 
plasmid interference. We tested PIMs containing different 
numbers of spacers in the active arrays (CRISPRl and 2) 
and that had protospacer targets with consensus PAMs (see 
below). Increasing the number of spacers further reduced 
the conjugation efficiency and the highest protection, of 
~3400-fold, was observed with nine new spacers (Figure 3C 
and Supplementary Figure S2). On average, across all PIMs 
tested, each spacer provided ~400-fold protection, but not 
all spacers provided equal protection. For example, PIMs 
with 4 or 5 spacers provided similar interference (Supple- 
mentary Figure S2), which might be due to factors such as 
altered crRNA abundance or stability, resulting in differ- 
ences in targeting. In summary, the new spacers acquired 
by the Type I-F system resulted in plasmid curing, inhib- 
ited both transformation and conjugation and additional 
spacers increased the level of protection. 



Protospacer distributions reveal biased spacer incorporation 

A characteristic feature of priming in the Type I-E system 
is that the majority of new spacers target the same strand 
as the original priming spacer (8,10,1 1). The 105 new spac- 
ers all mapped to the plasmid and represented 85 unique 
sequences (Figure 4 A and Supplementary Table S3). How- 
ever, the spacer locations within the different CRISPR ar- 
rays showed that they represent at least 100 independent 
incorporation events (i.e. they were not ancestors or sib- 
lings). The most common spacer length was 32 nt (89%) al- 
though 12/105 (11%) were 33 nt long. Surprisingly, 42% of 
new spacers were in the same orientation as the C2S6 prim- 
ing spacer (i.e. targeted the — strand) and 58% targeted the 
+ strand (Figure 4A; bar graph). Therefore, priming in the 
Type I-F system is strand-independent. 

Although the new spacers targeted both strands of the 
plasmid, the distribution of protospacers was biased. There 
was an increase in protospacers clustered near the primed 
protospacer (Figure 4A and C; bars) that cannot be ex- 
plained by the distribution of 793 GG PAMs over the plas- 
mid (the PAM 'landscape') (Figure 4B). For example, the 
majority of all protospacers were located in close proxim- 
ity to the primed protospacer and their frequency dimin- 
ished the further from the primed protospacer (Figure 4A 
and C; blue bars). The same trend was observed when using 
a shifting window size and when the data were corrected for 
PAM occurrence (Figure 4C; blue line). This biased distri- 
bution is consistent with a priming model, in which spacers 
targeting protospacers closer to the primed protospacer are 
more likely to be acquired than those targeting more distal 
locations. Another striking feature of the data was the ap- 
parent strand-specificity of protospacers relative to the lo- 
cation of the primed protospacer (Figure 4A and C). The 
presence of PAMs throughout the plasmid, indicates that 
an uneven PAM landscape cannot account for these strand- 
specific trends. Specifically, acquisition of new spacers com- 
plementary to the non-target strand (+ strand protospac- 
ers in this experiment), predominantly occurred close to the 
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primed protospacer, yet toward the 5' on the + strand (Fig- 
ure 4D; green). In contrast, incorporation of new spacers 
targeting the - strand (targeted relative to priming spacer) 
was more evenly distributed, occurring both 5' and 3' of, yet 
still clustering close to, the primed protospacer (Figure 4D; 
red). Therefore, the acquisition of new spacers during prim- 
ing in the Type I-F systems appears to proceed in a 3'-5' di- 
rection relative to the primed protospacer on the non-target 
strand and more diffusely in the vicinity of the primed pro- 
tospacer on the target strand. 

Protospacer orientation affects the bias in spacer incorpora- 
tion 

To test whether protospacer orientation influenced Type I-F 
priming, we developed a new system with the red fluorescent 
protein encoded by mCherry to visually detect plasmid loss 
and eliminate the need to screen antibiotic resistance. Plas- 
mids were created with no protospacer (negative control) or 
with - or + strand PAM-protospacers matching CRISPRl 
Spacer 1 (ClSl), but containing a -1 PAM mutation to 
trigger priming. Cultures of the WT harboring the no pro- 
tospacer negative control, or plasmids with - or + strand 
primed protospacer orientations, were passaged without se- 
lection for 5 days and plasmid loss scored (Supplementary 
Figure S3). Over 5 days there was minimal loss of the con- 
trol plasmid, while loss of the - and + strand primed plas- 
mids reached ~15% (Supplementary Figure S3B). 

The accelerated plasmid loss was due to CRISPR-Cas 
interference since spacer incorporation into the CRISPR 
arrays was detected and >120 new spacers for each plas- 
mid were sequenced (Supplementary Tables S4 and S5). 
The protospacer targets of the new spacers were mapped 
to both the - and + strand primed plasmids (Supplemen- 
tary Figure S4A and B), the PAM landscapes (Figure 5A 
and B), and the overall (Figure 5C and D) and strand- 
specific (Figure 5E and F) protospacer distributions were 
determined. Consistent with the eca0560 experiments, these 
assays demonstrated that spacer acquisition occurs more 
frequently near the primed protospacer, irrespective of the 
direction of the primed protospacer, and decreases further 
from the primed protospacer (Figure 5C and D). Interest- 
ingly, for the - strand primed plasmid, the data paralleled 
the earlier results with the eca0560 plasmid (also - strand 
primed), such that the acquisition of new spacers targeting 
the original non-target (+) strand, mainly occurred close to, 
yet 5' of the primed protospacer (Figure 4D versus Figure 
5E; green bars). Again, similar to the data in Figure 4, new 
spacers targeting the - strand (same target strand as origi- 
nal priming spacer) were more evenly distributed, occurring 
either side of, yet still close to, the primed protospacer (Fig- 
ure 5E; red). The opposite trends were detected when the 
protospacer was reversed (+ strand primed protospacer), 
with new protospacers on the original non-target (-) strand 
close to, but 5' of, the primed protospacer on the - strand 
(Figure 5F; red) and protospacers on the opposite strand 
on either side of the primed protospacer (Figure 5F; green). 
In summary, priming in the Type I-F system leads to ef- 
ficient acquisition of spacers targeting either strand of the 
invader. These spacers target close to the original primed 
protospacer, yet with an uneven strand distribution. 
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Priming demonstrates a GG PAM in Type I-F systems 

The Type I-F system of P. atrosepticum contains CRISPR-4 
(cluster 4) repeats (34), which were proposed in a bioinfor- 
matic study to possess a 5'-protospacer-GG-3' PAM (21) 
(Figure 6A). However, no study has analyzed Type I-F 
PAMs from a large number of experimental acquisition 
events. Therefore, we combined our data to determine the 
Type I-F PAM for new spacers incorporated through prim- 
ing (n = 351). Of the new spacers, 87% were 32 nt, 12% 
were 33 nt, <1% were 34 nt and one case of 31 nt was de- 
tected (Figure 6B). The 5' and 3' flanks of all protospac- 
ers were aligned and sequence logos generated (Figure 6C). 
No conservation was detected in the 5' flanking sequence 
or in the protospacer (Supplementary Figure S5), but a GG 
PAM was detected in the -1 and -2 positions of the 3' flank- 
ing sequence (i.e. a 5'-protospacer-GG-3' PAM). When pro- 
tospacer flanks were analyzed in separate groups based on 
length, the 32 and 33 nt groups displayed the same overall 
GG PAM (Figure 6D and Supplementary Figure S5A and 
B). 

Next, we examined protospacers with non-consensus 
PAMs (« = 28). When these were all aligned, GG PAMs 
were detected at the +1/-1 and -2/-3 positions in 24 of 28 



examples (i.e. shifted by ±1 nt relative to the original PAM 
location) (Figure 6E and Supplementary Figure S5C and 
D). In E. coli these were recently termed 'slippage' events 
(14). The size distribution of these non-PAM spacers was 
overrepresented for 33 nt (25% compared with 12%) (Figure 
6E versus D). When the 33 nt protospacers lacking a con- 
sensus -1/-2 PAM {n — 7) were analyzed, six contained a 
+ 1/-1 GG (Figure 6E and Supplementary Figure S5D). In- 
terestingly, this results in protospacers which are 32 nt from 
the 'correct' -1 /-2 PAM location. In conclusion, our data 
demonstrates that during priming in the I-F system, spacers 
are preferentially selected that match protospacers with a 5'- 
protospacer-GG-3' PAM and infrequently, spacers match 
protospacers with PAMs shifted by ±1 nt. 



DISCUSSION 

Our results have revealed that CRISPR adaptation via 
priming occurs in the Type I-F CRISPR-Cas system. When 
pre-existing spacers in the wild-type CRISPRs match pro- 
tospacers in invading plasmids that lack a consensus PAM, 
plasmid removal occurs and is accompanied by acquisition 
of new spacers into all three chromosomal CRISPR arrays. 
The new spacers interfere with transformation and conjuga- 
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tion and inhibition increases with more spacers. The spac- 
ers target both strands of the invader and these protospac- 
ers are clustered close to the initial primed protospacer. A 
strand-independent bias for new spacers occurs that sug- 
gests a model of priming that differs from that proposed for 
the Type I-E system. 

Priming in the Type I-E system has only been observed 
upon artificial overexpression of the cas operon, either by 
deletion of the hns repressor (11,15) or through the use of 
inducible cas gene expression (8,10). In contrast, we have 
shown that the native expression of the Type I-F cas genes is 
sufficient to provoke a priming response with nascent spac- 
ers incorporated into all three chromosomal CRISPRs. Re- 
markably, the number of acquisition events per CRISPR 
array closely correlated with the original number of exist- 
ing spacers in these three loci, providing evidence that the 
length of an array can be an indicator of its incorporation 
activity. The Type I-F system of P. atrosepticum appears the 
most active characterized system for priming, with up to 
nine new spacers acquired in a single strain. In the I-E sys- 



tem of E. coli a maximum of five or six new spacers has been 
detected (8,11). 

The newly incorporated spacers interfered with the ac- 
quisition of plasmids via either transformation or conjuga- 
tion. Type I-F systems have previously been shown to in- 
hibit growth upon chromosomal self-targeting (32), phage 
infection (18,35) and plasmid transformation (36), but con- 
jugation has not been examined. Surprisingly, although the 
new spacers in CRISPR 1 and CRISPR2 impeded plas- 
mid uptake, CRISPR3 was inactive for interference, despite 
acquiring new spacers. All three CRISPR arrays are ex- 
pressed and processed by Cas6f into ~60 nt crRNAs; how- 
ever, CRISPR3 is the most weakly expressed (28). Con- 
sistent with the interference deficit of CRISPR3, no PIMs 
were isolated that contained new spacers only in CRISPR3; 
any incorporations were accompanied by integrations in 
CRISPRl and/or CRISPR2. Whether low expression or 
sequence deviation within the CRISPR3 repeats account 
for the lack of targeting requires further investigation. For 
CRISPRl and CRISPR2, the level of protection against 
conjugation was dependent on the number of spacers, with 
increased protection with up to nine new spacers, indepen- 
dent of whether the leading or lagging strand was targeted 
(37). This is consistent with the proposal that priming al- 
lows not only a rapid response to 'escape' mutants, but that 
by acquiring multiple spacers the response is strengthened 
and would reduce the possibility of further escape (11). 

By analyzing ~350 newly acquired spacers, we have been 
able to experimentally define the PAM used in priming 
adaptation in the Type I-F system (5'-protospacer-GG-3'). 
This is identical to the PAM predicted computationally by 
Mojica et al. (21) and matches the PAM observed in a 
few spacers incorporated in P. aeruginosa (18). This GG 
PAM is consistent with those shown previously to be in- 
volved in productive interference in the Type I-F systems 
(18,21,32,36). Interestingly, less than 10% of new spacers 
had targets that lacked a GG at the consensus -1 /-2 po- 
sition. Most of these contained a GG PAM shifted by ±1 
nt, such that GG was at either the +1/-1 or the -2/-3 po- 
sition. Whether the +1/-1 PAMs enable interference is not 
known, whereas the -2/-3 PAM 'slippage' events result in 
a G at position -3, which is sufficient for interference in an 
E. co// Type I-F system (36). This suggests that the Type I-F 
system can make 'errors' during spacer selection relative to 
the PAM location; however, some of these spacers may still 
be functional for interference. 

Priming in the Type I-F system results in the incorpora- 
tion of new spacers that target either strand of the invader. 
Indeed, across three independent experiments there was an 
equal number of new spacers targeting either DNA strand 
(Figure 7A). Despite this equal strand distribution, there is 
a bias in the target locations of the newly acquired spac- 
ers, with an obvious enrichment for protospacers close to 
the location of the initial primed protospacer. Furthermore, 
the strand of the primed protospacer influences the loca- 
tion and strand targeted by new spacers. Specifically, the 
priming spacer is complementary to the primed protospacer 
(the target strand) and new protospacers on the non-target 
strand (relative to original priming event) are preferentially 
located 5' (~43%) compared with 3' (<8%) of the primed 
protospacer (Figure 7B). On the target strand, new proto- 
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spacers are also more commonly located 5' of the primed 
protospacer (~30%) compared with 3' (~20%), but this dis- 
tribution is more even than the non-target strand bias (Fig- 
ure 7B). Overall, on both the target and non-target strands 
there is a strong protospacer location bias with ~75% 5' and 
only ~25% 3' relative to the original, primed protospacer 
(Figure 7C). 

Based on our data and previous studies we propose a 
working model for priming in the Type I-F CRISPR-Cas 
systems (Figure 8). First, a Csy complex forms that con- 
tains a crRNA with the priming spacer that guides the com- 
plex to the invader dsDNA (26,29). The lack of a con- 
sensus PAM (and possibly other protospacer mismatches 
(15)) is likely to result in weak/infrequent binding of the 
CsyxrRN A complex to the protospacer target strand in the 
invader and causes the formation of an R loop with the 
non-target strand displaced (38). After R loop formation, 
the native Cas2-3 hybrid protein, in an adaptation complex 
with Casl (26), is recruited to the displaced ssDNA non- 
target strand, potentially via Csyl (26,39-41). Cas3 proteins 
possess adenosine triphosphate-dependent helicase activ- 
ity and unwind dsDNA in a 3'-5' direction, while the HD 
nuclease domain cuts the translocating strand (41^3). We 
propose that the interaction between Casl and Cas2-3 re- 
sults in a translocating adaptation complex that, upon en- 
countering a GG PAM, generates a new spacer and inte- 
grates this into one of the CRISPR arrays. This could ac- 
count for the preferential location of protospacers 5' of the 
primed protospacer on the non-target strand. Whether this 
new spacer is derived from dsDNA or from one or the other 
strand is not known. We hypothesize that the helicase activ- 
ity and movement of the Casl:Cas2-3 complex along the 
non-target strand in the 3'-5' direction results in the dis- 
placement of the target strand, and this ssDNA might aid 
the recruitment of Casl :Cas2-3 complexes that will translo- 
cate in the 3'-5' direction on the target strand (41). Since 
the target strand begins to be exposed as ssDNA 3' of the 
primed protospacer, the acquisition of new spacers that tar- 
get this strand initiates 3' of the primed protospacer and 
proceeds in the 3'-5' direction. This would result in the tar- 
get strand protospacer distribution we observe (Figures 7B 
and 8D), where protospacers are present on either side of 
the primed location, but still with a preference towards the 
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5' on the target strand. The molecular details of this model 
will require testing in future studies. 

This model for priming in the Type I-F CRISPR-Cas sys- 
tem is in contrast to the I-E system in E. coli, where prim- 
ing results in the preferential acquisition of new spacers that 
target the same strand as the original priming spacer (8,11). 
In addition, protospacer clustering is not observed in the 
Type I-E system, where there was no apparent trend in the 
target location of new spacers during priming when small 
plasmids with a limited number of PAMs were analyzed 
(10). Therefore, despite being evolutionary related (7), the 
mechanism of priming in the Type I-F system is clearly dif- 
ferent from the Type I-E system. A recent study showed 
that priming occurs in the Type I-B system of H. hispan- 
ica (20,25). Spacers were acquired that target either strand, 
but there was an apparent strand bias on either side of the 
primed protospacer. It was proposed that following Cas- 
cade binding and R-loop formation, Cas3 is recruited to 
the displaced non-target strand and nicks it. Next, in a pos- 
sible Cascade replacement process, Cas3 was hypothesized 
to bind the target strand near the primed protospacer. The 
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movement of Cas3 along the ssDNA in a 3'-5' direction on 
both strands from the primed protospacer might account 
for the biased distribution of new protospacers. The adapta- 
tion data for the Type I-F system is therefore more similar to 
the Type I-B than to the I-E system. The higher resolution 
Type I-F data and our protospacer reversal experiments re- 
vealed spacer acquisition trends that have led us to a related, 
yet distinct model to the one proposed by Li et al. (20). Cru- 
cially, we provide data that the frequency of spacer acqui- 
sition decreases with increasing distance from the primed 
protospacer, which supports a Casl:Cas2-3 translocation 
model where priming proximal PAMs are more likely to 
be substrates for new spacers (Figure 8). Moreover, the 
different spacer distributions that we detect on the target 
versus non-target strands suggest that the recruitment of 
Casl:Cas2-3 to the target strand occurs at multiple posi- 
tions following displacement of the target strand as Cas2-3 
translocates along the non-target strand. Further work is 
required to determine if the Type I-B and I-F systems use 
the same mechanism for priming, or if they are subtly dif- 
ferent. The unique Cas2-3 fusion in the Type I-F systems 
might indicate that priming proceeds in a different manner. 
In summary, the general phenomenon of priming adapta- 
tion is conserved between CRISPR-Cas subtypes but sig- 
nificant differences exist. The exact molecular mechanisms 
underlying these processes now need to be addressed. 

SUPPLEMENTARY DATA 

Supplementary data are available at NAR Online. 
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